Isolated Mandarin Syllable Recognition with Limited Training Data Specially Considering the Effect o - Speech and Audio Processing, IEEE Transactions on
نویسندگان
چکیده
In this correspondence, a set of new approaches is proposed to model the Mandarin syllables for accurate recognition with limited training data while specially considering the effect of tones, including improved initial values and state transition topologies, and making use of the durational cue. The results show that these approaches are very useful practically.
منابع مشابه
Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary
AhtractThis paper describes the first successfully implemented real-time Mandarin dictation machine developed in the world which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers. Considering the special characteristics of the Chinese language, syllables are chosen as the basic units for dictation. The machine is ...
متن کاملTone recognition of continuous Mandarin speech based on neural networks
Several neural network-based tone recognition schemes for continuous Mandarin speech are discussed. A basic MLP tone recognizer using recognition features extracted from the processing syllable is first introduced. Then, some additional features extracted from neighboring syllables are added to compensate for the coarticulation effect. It is then further improved to compensate for the effect of...
متن کاملDiscriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and more important. In this paper, considering the monosyllabic structure of the Chinese language, a whole class of syllable-based indexing features, including overlapping segments of syllables and syllable pairs separated ...
متن کاملClassification of Thai Tone Sequences in Syllable-Segmented Speech Using the Analysis-by-Synthesis M - Speech and Audio Processing, IEEE Transactions on
Tone classification is important for Thai speech recognition because tone affects the lexical identification of words. An analysisby-synthesis algorithm for classifying Thai tones in syllable-segmented speech is presented that uses an extension to Fujisaki’s model for tone languages that incorporates tonal assimilation and declination. The classifier correctly identifies all of the tones in 89....
متن کاملA modular RNN-based method for continuous Mandarin speech recognition
A new modular recurrent neural network (MRNN)-based method for continuous Mandarin speech recognition (CMSR) is proposed. The MRNN recognizer is composed of four main modules. The first is a sub-MRNN module whose function is to generate discriminant functions for all 412 base-syllables. It accomplishes the task by using four recurrent neural network (RNN) submodules. The second is an RNN module...
متن کامل